3,778 research outputs found
Efficient estimation of AUC in a sliding window
In many applications, monitoring area under the ROC curve (AUC) in a sliding
window over a data stream is a natural way of detecting changes in the system.
The drawback is that computing AUC in a sliding window is expensive, especially
if the window size is large and the data flow is significant.
In this paper we propose a scheme for maintaining an approximate AUC in a
sliding window of length . More specifically, we propose an algorithm that,
given , estimates AUC within , and can maintain this
estimate in time, per update, as the window slides.
This provides a speed-up over the exact computation of AUC, which requires
time, per update. The speed-up becomes more significant as the size of
the window increases. Our estimate is based on grouping the data points
together, and using these groups to calculate AUC. The grouping is designed
carefully such that () the groups are small enough, so that the error stays
small, () the number of groups is small, so that enumerating them is not
expensive, and () the definition is flexible enough so that we can
maintain the groups efficiently.
Our experimental evaluation demonstrates that the average approximation error
in practice is much smaller than the approximation guarantee ,
and that we can achieve significant speed-ups with only a modest sacrifice in
accuracy
A tool for subjective and interactive visual data exploration
We present SIDE, a tool for Subjective and Interactive Visual Data Exploration, which lets users explore high dimensional data via subjectively informative 2D data visualizations. Many existing visual analytics tools are either restricted to specific problems and domains or they aim to find visualizations that align with user’s belief about the data. In contrast, our generic tool computes data visualizations that are surprising given a user’s current understanding of the data. The user’s belief state is represented as a set of projection tiles. Hence, this user-awareness offers users an efficient way to interactively explore yet-unknown features of complex high dimensional datasets
An optical fibre dynamic instrumented palpation sensor for the characterisation of biological tissue
AbstractThe diagnosis of prostate cancer using invasive techniques (such as biopsy and blood tests for prostate-specific antigen) and non-invasive techniques (such as digital rectal examination and trans-rectal ultrasonography) may be enhanced by using an additional dynamic instrumented palpation approach to prostate tissue classification. A dynamically actuated membrane sensor/actuator has been developed that incorporates an optical fibre Fabry–Pérot interferometer to record the displacement of the membrane when it is pressed on to different tissue samples. The membrane sensor was tested on a silicon elastomer prostate model with enlarged and stiffer material on one side to simulate early stage prostate cancer. The interferometer measurement was found to have high dynamic range and accuracy, with a minimum displacement resolution of ±0.4μm over a 721μm measurement range. The dynamic response of the membrane sensor when applied to different tissue types changed depending on the stiffness of the tissue being measured. This demonstrates the feasibility of an optically tracked dynamic palpation technique for classifying tissue type based on the dynamic response of the sensor/actuator
Dear Wife : the Civil War letters of Chester K. Leach
Occasional paper (University of Vermont. Center for Research on Vermont) ; no. 20
Bioinformatics tools in predictive ecology: Applications to fisheries
This article is made available throught the Brunel Open Access Publishing Fund - Copygith @ 2012 Tucker et al.There has been a huge effort in the advancement of analytical techniques for molecular biological data over the past decade. This has led to many novel algorithms that are specialized to deal with data associated with biological phenomena, such as gene expression and protein interactions. In contrast, ecological data analysis has remained focused to some degree on off-the-shelf statistical techniques though this is starting to change with the adoption of state-of-the-art methods, where few assumptions can be made about the data and a more explorative approach is required, for example, through the use of Bayesian networks. In this paper, some novel bioinformatics tools for microarray data are discussed along with their ‘crossover potential’ with an application to fisheries data. In particular, a focus is made on the development of models that identify functionally equivalent species in different fish communities with the aim of predicting functional collapse
- …